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We suggest two classes of multivariate GARCH-models which are both easy to estimate and 
perform well in forecasting the covariance matrix of more than one hundred stocks. We apply 
methods from random matrix theory (RMT) to determine the number of principal components or 
the number of factors in the multivariate volatility models. In this way only statistically relevant 
information is used for the estimation of model parameters. 
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The basic input for the estimation of portfolio risk is 
a forecast of time varying covariance matrices. Although 
modern risk management is founded on the analysis of 
huge data bases and the use of sophisticated theoret- 
ical models, there has been only partial progress with 
respect to covariance forecasts. The reason for this lack 
of progress is the "curse of dimensionality" : in modern 
financial engineering, the dimension of the investment 
universe is comparable to the number of observations, 
and hence the number of parameters in the covariance or 
correlation matrix is of the same order as the number of 
data points. 

Traditional multivariate statistics is not able to deal 
with this kind of problem. Both estimators and tests 
are consistent only in the limit of L/N going to infin- 
ity, where L is the number of observations and N is the 
dimension of the observed vector . In typical applica- 
tions, one is dealing with ratios L/N ranging from one 
to ten, and hence statistical fluctuations due to the lack 
of a sufficient number of observations dominate an em- 
pirically observed covariance matrix. This measurement 
noise induces randomness in the covariances and hence 
turns the covariance matrix at least partly in a random 
matrix. 

A covariance matrix can be decomposed into the cross- 
correlation matrix C and the set of standard deviations 
{erf}. To identify the effects of randomness on the eigen- 
value spectrum of an empirical cross-correlation matrix, 
one considers the null hypothesis of completely uncorre- 
cted and identically distributed (i.i.d.) time series. The 
cross-correlation matrix calculated from these time se- 
ries is a random matrix R from the so called Wishart 
ensemble. The eigenvalues of a random Wishart matrix 
fall in the interval [A_, A+] with A± = 1 + N/L + 2 V / N/L 
. This eigenvalue distribution can be compared to that 
of the empirical cross-correlation matrix C. Agreement 
between R and C is a sign of randomness, whereas devi- 
ations indicate the presence of economically meaningful 
information. It was found that approximately 94 % of 
the empirically observed eigenvalues fall in the random 
matrix interval |3j. The random matrix nature of the 
main part of the spectrum was proven by comparing the 
eigenvalue statistics of empirical correlation matrices to 
the universal predictions of random matrix theory £j. 
Only the largest eigenvalues and corresponding eigenvec- 
tors carry information about market correlations, busi- 



ness sectors and geographical correlations |5j, |fj . These 
economically meaningful correlations are stable in time 
and hence suitable for forecasting future correlations. 
Generally, correlations show a higher degree of time sta- 
bility than covariances which are fluctuating due to time 
changing volatility. 

The time evolution of the variance or volatility is 
often described with GARCH-models 0, which relate 
the present variance to past variances and past returns. 
While GARCH-processes are well suited to describe the 
dynamics of a univariate volatility process they are 
less successful in the multivariate setting. In addition 
to the "curse of dimensionality" , the application of 
multivariate GARCH-models is hampered by the need 
to make maximum likelihood estimates of a large number 
of parameters growing generically like the square of 
the number of time series. In this note, we suggest 
two classes of multivariate GARCH-models, which 
incorporate RMT based progress in the understanding of 
empirical cross-correlation matrices. These models are 
both easy to estimate and compare favorably to alterna- 
tive models as the RiskMetrics covariance estimator 0. 

High dimensional correlation matrices 

We estimate the sample covariance matrix 
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and define the sample correlation matrix = 
Yiij / (<Ji<jj) by normalizing with the standard deviations 
a, = ySii- We calculate cross-correlation matrices 
for the 118 most frequently traded German companies 
from a) 1711 daily returns e,,t starting 12/01/93 until 
08/31/00, and b) 250 daily returns starting 09/01/99 
until 08/31/00, diagonalize them, and rank-order their 
eigenvalues A^ < Xk for i < k. The pdf of the eigenval- 
ues of C a is displayed in Fig. ^ F° r both C a and Cb we 
find that three eigenvalues lie outside the random ma- 
trix interval, but only one of them clearly. The number 
of eigenvalues clearly outside the RMT-interval is the 
optimal number of principal components or factors that 
should be used to forecast the correlation matrix [Tl| . 

The largest eigenvalue A max = 14.5 for the seven-year 
interval is much larger than A max = 7.1 for the one- 
year interval. This confirms the result of 0, 0, 0] that 
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FIG. 1: Probability distribution of the eigenvalues of the 
cross-correlation matrix C a calculated from daily 118 return 
time series from 09/01/99 until 08/31/00 (gray line) . The 
bulk of eigenvalues is due to noise and well described by the 
prediction of random matrix theory (black line). One eigen- 
value is clearly separated from the bulk and contains infor- 
mation about market correlations. 

the strength of market correlations changes in time. Al- 
though the use of time series extending over several years 
in theory increases the accuracy of correlation estimates, 
it does not capture the effect of time changing correla- 
tions and hence is not advisable. 

In order to use only the statistically relevant informa- 
tion in the correlation matrix C, we diagonalize C by an 
orthogonal transformation X via X T CX = A. We con- 
struct a filtered correlation matrix by keeping only 
the p largest eigenvalues in the diagonal matrix [n| 

.p _ J Aji for i > N - p ,„s 
« \ for i < N - p ' [ 1 

and by transforming back to the original basis 
C p = XA P X T . For consistency, all diagonal ele- 
ments of C p are set to one. 

Model description 

The basis of the Sliding Correlation Model (SCM) 
is the decomposition of the covariance matrix into the 
cross-correlation matrix C and standard deviations {o~i}. 
Each individual time series is described by a univariate 
GARCH(1,1) process defined by 

°i,t = a h0 + ai,i°f,t-i + &A e i,t-i • ( 3 ) 

Here, af t and e^ t are the volatility and the innovation 
of time series i at time t, respectively. The innovations 
are modelled as a product e^t = a^t ■ x^t with normally 
distributed x € A/"(0, 1). The parameters a^o, a^i, and 
Pi t x are obtained from maximum likelihood estimations. 
The cross-correlation matrix C is calculated as a moving 
average to accommodate the change of correlations in 
time. We use the filtering method described in the last 
section and keep only the p largest principal components. 
The covariance matrix in the sliding correlation model 
SCM(p) of rank p is described by 

= C ij,t a i,t Vj,t ■ (4) 



This model is related to the constant correlation model 
|l2j |. where a fixed and unfiltered correlation matrix is 
used in contrast to our Cfj t . 

Factor Models are motivated by the observation that 
correlations are large in times of high volatility [(|. In 
addition, general multivariate GARCH-models can be 
simplified to factor models, if the parameter matrices do 
not have full rank. As a special case with parameter 
matrices of rank one, a one-factor model was discussed 
in ^3] . We suggest a more general model with p factors 
(abbreviated Factor(p)) 

fe=i 

Here, af t is the variance of the residual of time series i 
after linear regression on the p factors / t with k = l...p, 
and \\ k ' is the regression coefficient (factor loading) of 
factor k on time series i. The dynamics of the residuals is 
described by a GARCH(l,l)-process, and the dynamics 
of the factors is similarly given by 

sf^af'+^EW+af'^) 2 . (6) 

The factors / t (fc) are defined via an iterative procedure: 
we use the largest eigenvector of C to define = 
J2iLi u i, N ^ e i,t- Next, we perform a linear regression of 
on the ti and recompute the correlation matrix from 
the residuals. From the new eigenvector corresponding 
to the largest eigenvalue, we define a factor f^ 2 \ per- 
form another linear regression and so on. The number p 
of factors is chosen according to the number of eigenval- 
ues of the correlation matrix outside the RMT-interval, 
the time interval for the calculation of factors and factor 
loadings is not moving but fixed. 

The data set for parameter estimation comprises seven 
years of daily data from 12/01/93 until 08/31/00, which 
are used for the maximum likelihood estimation of all 
GARCH parameters, for the calculation of factors and 
factor loadings of the models Factor(p)-seven, whereas 
only the last year of daily data is used for models 
Factor(p)-one. Similarly, the cross-correlation matrix 
of model SCM(p)-seven is calculated over a sliding time 
window of length seven year, and a window of length one 
year is used for model SCM(p)-one. 

Our benchmark model is the commonly used Risk Met- 
rics covariance estimator defined by 

Etf,t = 0.94 Sjj jt _! + (MM!,..,,,., . (7) 
Test method 

We judge the quality of a multivariate volatility model by 
its ability to forecast the daily variance of both equally 
distributed and minimum variance portfolios. The pre- 
dicted portfolio volatility at time t is given by 

N 

^predicted = X/ mi ' t m ^'* ' 
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FIG. 2: Prediction of daily volatility of a), b) equally 
weighted portfolios and c), d) minimum variance portfolios. 
The black line shows the estimated volatility, the thin grey 
line the realized volatility calculated from high frequency data 
while the thick grey line is an seven-day average of the realized 
volatility, a) Sliding correlation model SCM(l)-seven, which 
uses one principal component and is estimated over a seven- 
year period, overestimates the variance by 65 % due to an 
overestimation of the average correlation strength, b) Model 
Factor(l)-seven, which uses one factor and is estimated over 
a seven-year period is able to describe changes in the corre- 
lation strength but tends to overestimate correlations in the 
time after a volatility peak, c) RiskMetrics estimator Eq. |7| 
predicts a very small variance for a minimum variance portfo- 
lio, while the realized variance is a factor 35 larger, d) Sliding 
correlation model SCM(l)-one uses one principal component 
and estimates correlations over a sliding time window of one 
year. The predicted variance is close to the realized one, which 
is more than 50 % smaller than for the RiskMetrics estimator. 



Here, m^t is the fraction of the capital invested in stock 
i at time t. For an equally distributed portfolio, we have 
rrii t t = 1/N, hence D 2 is just the average element of the 
covariance matrix. This test probes the ability of a model 
to correctly predict the average covariance between pairs 
of stock. On the other hand, 
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minimize the variance under the constraint that the total 
invested money is equal to one. The volatility predic- 
tion for minimum variance portfolios is a difficult test for 
volatility models, as the covariance matrix is first used 
to calculate the portfolio weights and then to estimate 
the variance of that portfolio. We compare the predicted 



volatility to the realized volatility, which is calculated 
from high frequency (hourly) data [8j . The realized port- 
folio variance at time t is calculated as 
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m ht m jtt ((g ilt 9j,t) ~ (9i,t){9j,t)) (10) 



where the expectation values are taken for 10 hourly 
returns gi per trading day. We perform an out of sample 
test from 09/01/00 until 08/31/01. We calculated the 
average predicted portfolio variance, the average realized 
variance, the mean square error (MSE) of the prediction 
and the mean average percentage error (MAPE). 

Test results: 

For equally distributed portfolios, the performance of the 
various factor models is very similar to each other. Model 
Factor (l)-seven, for example, overestimates the overall 
level of variance but is able to describe the effect of en- 
hanced correlations in high volatility periods (Fig. |2J. 

The variance estimation of SCM(N)-seven and 
SCM(l)-seven is very similar to each other, both mod- 
els severely overestimate the average level of correlations, 
see Fig. [21 for the results of SCM(l)-seven. The sliding 
correlation models with a one year time window for corre- 
lation estimates provide for the best estimates of realized 
variances, the RiskMetrics estimator shows comparable 
results. These results indicate that a good multivariate 
volatility model must be able to capture not only the 
change of volatility over time but also the change of the 
average correlation strength. For details see Table |U 

Minimum variance portfolio: Due to the measurement 
noise contained in its cross-correlation matrix, the model 
SCM(N)-one clearly underestimates the variance of mini- 
mum variance portfolios, the ratio of realized variance to 



4 



Model 


n 2 

-^estimated 


n 2 

-'-'realized 


MSE 


MAPE 


SCM(l)-seven 


0.654 


0.396 


0.092 


0.886 


SCM(2)-seven 


0.666 


0.396 


0.098 


0.916 


SCM(N)-seven 


0.643 


0.396 


0.086 


0.856 


SCM(l)-onc 


0.497 


0.396 


0.037 


0.486 


SCM(2)-one 


0.500 


0.396 


0.037 


0.493 


SCM(N)-one 


0.484 


0.396 


0.034 


0.456 


Factor(l)-seven 


0.568 


0.396 


0.063 


0.615 


Factor(2)-seven 


0.576 


0.396 


0.065 


0.634 


Factor(l)-one 


0.571 


0.396 


0.062 


0.618 


Factor(2)-one 


0.575 


0.396 


0.064 


0.629 


RiskMetrics 


0.519 


0.396 


0.057 


0.535 



TABLE I: Estimation of the variance of equally distributed 
portfolios 



Model 


n 2 

^estimated 


n 2 

-'-'realized 


MSE 


MAPE 


SCM(l)-seven 


0.177 


0.243 


0.022 


0.335 


SCM(2)-seven 


0.188 


0.238 


0.025 


0.415 


SCM(N)-seven 


0.190 


0.245 


0.029 


0.409 


SCM(l)-one 


0.158 


0.208 


0.016 


0.340 


SCM(2)-one 


0.157 


0.208 


0.017 


0.338 


SCM(N)-one 


0.128 


0.284 


0.046 


0.479 


Factor(l)-seven 


0.215 


0.304 


0.033 


0.318 


Factor(2)-seven 


0.238 


0.303 


0.035 


0.375 


Factor(l)-one 


0.212 


0.300 


0.031 


0.319 


Factor(2)-one 


0.220 


0.312 


0.033 


0.305 


RiskMetrics 


0.013 


0.453 


0.241 


0.967 



TABLE II: Estimation of the variance of minimum variance 
portfolios 



predicted variance is 2.22. The filtered model SCM(l)- 
one performs much better in this respect, the ratio is 
given by 1.32 (Fig. 2d). In addition, the realized vari- 
ance of the "filtered" model is 27% lower than that of the 
original model. Adding one more eigenvalue in the cross- 
correlation matrix does not change this result much. The 
performance of the various factor models with one or two 
factors and estimation periods of seven or one year is 
quite similar to each other, therefore we discuss the rep- 
resentative model Factor (l)-seven (detailed results can 
be found in Table IXTf) . Although the ratio of realized to 
predicted variance is with 1.46 better than that of model 
SCM(N)-one, the realized variance itself is higher than 
that of the other models. Possibly the performance of 
the factor models could be increased if the factor load- 
ings were estimated by the maximum likelihood method 
instead of using the regression coefficients. Details of 
the results can be found in Table [H] All these results 
compare favorably to the RiskMetrics covariance esti- 
mator, which underestimates the portfolio variance by 
a factor of 35 (Fig. 2c). Also, the results reported in [14J 
for DCC-GARCH (overestimation of risk by several hun- 
dred percent) are much less precise than the estimates of 
SCM(l)-one, for example. 

In summary, we have shown that the "curse of dimen- 
sionality" can be overcome by using appropriate statis- 
tical tools relying on RMT. Building on the distinction 
between statistically relevant information and noise in 
the correlation matrix, we have suggested two types of 
multivariate GARCH-models that are easy to estimate 
even for more than 100 time series. Both the sliding 
correlation model and the factor model studied have a 
much better prediction quality than the industry stan- 
dard RiskMetrics covariance estimator. 
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